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ANTIMICROBIAL PROTEINS FROM THE SPOl BACTERIOPHAGE 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This Application claims priority to provisional application 60/457287 filed on 

STATEMENT REGARDING FEDERALLY SPONSORED 
RESEARCH OR DEVELOPMENT 

[02] The present invention may have been developed with funds from the United 
States Government. Therefore, the United States Government may have certain rights in the invention. 

REFERENCE TO A COMPACT DISK APPENDIX 

A Sequence Listing, including SEQ ID NO: 1 to 24, is submitted with this 

FIELD OF THE INVENTION 

[04] The invention relates to proteins from the SPOl bacteriophage, genes encoding 
same, and their use in arresting bacterial metabolism. Further, the present invention relates to a method 
of treating infected patients through the introduction of proteins, peptide fragments of the proteins or 
peptidomimetics, or genes encoding same into the bacteria infecting the patient. 

BACKGROUND OF THE INVENTION 

[05] Viruses are obligate parasites that use the host's mechanisms for expressing their 
genes and propagating themselves. Bacteriophage, or "phage" for short, are viruses that infect bacteria. 
Phage have a host range that is often fairly narrow, infecting a single species or a group of related 
species. 
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[06] Some bacteriophage, such as T4, always kill their hosts within a short time after 
infection-they are called LYTIC phages. Others, such as Lambda, can also establish themselves in a 
dormant form within the cell and be maintained stably over many generations-they are called 
LYSOGENIC phages. On infection lysogenic phages have a choice between a lytic cycle or to establish 
lysogeny. 

[07] In a lysogenic cycle, the phage integrates into the host chromosome at a specific 
point using a site-specific recombination process. The expression of most phage genes is shut down by 
a repressor and the phage genome can be replicated as part of the host chromosome. Under certain 
conditions, usually involving host stress, the genes get switched on again and the phage genome is 
excised from the chromosome, replicated and new phage particles are released by lysing the cell. 

[08] After their discovery early in the 20th century, phage were widely used to treat 
various bacterial diseases in people and animals. After an enthusiastic beginning, poorly designed 
research protocols and the development of antibiotics ended most phage therapy research in the West. 
However, phage therapy continued in Poland and the former Soviet Union for decades and the Eastern 
European experience should serve to bootstrap the technology into the U.S. Thus, although there are 
few, if any, clinical trials underway in the U.S., the technology is reasonably well developed and its 
advantages and limitations are well understood. 

[09] The alarming rise in antibiotic resistance in bacteria is leading to a resurgence of 
interest in bacteriophage research. In fact several companies exist that are dedicated to the realization of 
phage therapy. The Eliava Institute of Bacteriophage, Microbiology & Virology in the Republic of 
Georgia has been an international leader in phage therapy for over 70 years and has the world's largest 
collection of therapeutic phages. Other companies including Exponential Biotherapies, Inc. of New 
York; Intralytix, Inc. in Maryland; Phage Therapeutics International, Inc. (PhageTx) of British 
Columbia; Biophage, Inc. of Quebec; and PhageTech, Inc. of Quebec are developing commercial phage 
therapies. 

[10] The limitations and advantages of phage therapy are listed by Elizabeth Kutter of 
Evergreen State College at http://www.evergreen.edu/phage/phagetherapy.html. Her lists are worth 
iterating here in order to understand the potential and limitations of phage therapy. The limitations 
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include: 1) Paucity of understanding of the Heterogeneity and ecology of both the phages and the 
bacteria involved; 2) Failure to select phages of high virulence against the target bacteria before using 
them in patients; 3) Use of single phages in infections that involve mixtures of different bacteria; 4) 
Emergence of resistant bacterial strains, which can occur by selection of resistant mutants (a frequent 
occurrence if only one phage strain is used against a particular bacterium) or by lysogenization (if 
temperate phages are used); 5) Failure to appropriately characterize or titer phage preparations, some of 
which were totally inactive; 6) Failure to neutralize gastric pH prior to oral phage administration; 7) 
Inactivation of phages by both specific and nonspecific factors in body fluids; 8) Liberation of 
endotoxins as a consequence of widespread lysis of bacteria in the body (this is called the Herxheimer 
reaction); and 9) Failure to identify the bacterial pathogens involved necessitated by the relative 
specificity of phage therapy. 

[11] The advantages of phage therapy include: 1) They are self-replicating and self- 
limiting, because they multiply only as long as sensitive bacteria are present and then are gradually 
eliminated; 2) They target specific bacteria, causing less damage to the normal microbial balance in the 
body; 3) Phage can often be targeted to receptors on the bacterial surface that are involved in 
pathogenesis, so that any resistant mutants are attenuated in virulence; 4) Few side effects have been 
reported for phage therapy; 5) Phage therapy would be particularly useful for people with allergies to 
antibiotics; 6) Appropriately selected phages can easily be used to prevent bacterial disease in people or 
animals at times of exposure, or to sanitize hospitals and help protect against hospital-acquired 
infections; 7) Phage can be prepared fairly inexpensively and locally, facilitating their potential 
applications to underserved populations; 8) Phage can be used independently or in conjunction with 
other antibiotics to help reduce the development of bacterial resistance; and 9) Multiple delivery means 
are available, including noninvasive means like topical application, oral administration, and inhalation. 

[12] One lytic phage of long time interest to our laboratory is called "SPOl." SPOl 
has a linear dsDNA genome of 140 kb, and its host is the bacterium Bacillus subtilis. Early in SPOl 
infection of B. subtilis, the synthesis of most host-specific molecules is replaced by the corresponding 
phage-specific biosyntheses. Subversion of the host machinery is accomplished primarily by a cluster 
of early genes in the SPOl terminal repeat in an 1 1.5-kb "host-takeover module." The module includes 
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24 genes, tightly packed into 12 operons driven by the previously identified early promoters PE1 to 
PE12. 

[13] The 24 genes are smaller than average, with half of them having fewer than 100 
codons. Most of their inferred products show little similarity to known proteins, although zinc finger, 
trans-membrane, and RNA polymerase-binding domains were identified therein. Transcription- 
termination and RNase III cleavage sites were identified in the nucleotide sequence as well. We have 
placed most of these 24 genes into an inducible B. subtilislE. coli shuttle vector and introduce them here 
for use in phage therapy and as antimicrobial agents. 

BRIEF SUMMARY OF THE INVENTION 

[14] As used herein "tightly regulated inducible promoter 1 ' is an inducible promoter 
* that does not express detectable levels of protein in the absence of the inducing agent. Detectable levels 
of protein expression could be indicated by inhibition of bacterial host cell growth. In other words, the 
promoter is not "leaky" to a degree sufficient to prevent cloning of the gene in the host cell. 

[15] As used herein "recombinant" is relating to, derived from, or containing 
genetically engineered material. 

[16] "% identity" is calculated over the entire length of a specified sequence, thus short 
local alignments with a claimed sequence are not relevant (e.g., % identity = number of aligned 
residues/length of reference sequence). Alignments are performed using BLAST (Basic Local 
Alignment Search Tool) homology alignment as described by Tatusova and Madden (1999). The 
default parameters are used, except the filters are turned OFF. As of Jan. 1, 2001 the default parameters 
were as follows: BLASTN or BLASTP as appropriate; Matrix = none for BLASTN, BLOSUM62 for 
BLASTP; G Cost to open gap default = 5 for nucleotides, 1 1 for proteins; E Cost to extend gap [Integer] 
default = 2 for nucleotides, 1 for proteins; q Penalty for nucleotide mismatch [Integer] default = -3; r 
reward for nucleotide match [Integer] default = 1; e expect value [Real] default = 10; W wordsize 
[Integer] default = 1 1 for nucleotides, 3 for proteins; y Dropoff (X) for blast extensions in bits (default if 
zero) default = 20 for BLASTN, 7 for other programs; X dropoff value for gapped alignment (in bits) 30 
for BLASTN, 15 for other programs; Z final X dropoff value for gapped alignment (in bits) 50 for 
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BLASTN, 25 for other programs. This program is available online at 

http://www.ncbi.nlm.nih.gov/BLAST/. 

[17] The invention provides expression vectors encoding a protein comprising a 
sequence selected from the group consisting of SEQ ID NO: 1-24, wherein the expression of the protein 
is controlled by an inducible promoter. The transcription must be under sufficiently tight control so as 
to allow the vector to be propagated in the host cell. 

[18] The invention also provides a method of designing peptidomimetic anti-bacterial 
drugs comprising preparing proteins of SEQ ID NO: 1 to 24 or variants with at' least 95% identity over 
the entire length of the specified sequence by recombinant or synthetic means (e.g., peptide synthesis), 
determining their three-dimensional structures, and using the three-dimensional structure as the basis for 
designing peptidomimetic small molecules to be tested for anti-bacterial activity. 

[19] Methods of treating a bacterial infection in a warm blooded animal in need 
thereof are also provided. The methods require the direct in vivo delivery of an antimicrobial protein as 
described above, in an amount effective to kill bacteria, or delivery of a peptidomimetic small molecule 
having the bactericidal activity of the protein. Alternatively, the protein can be indirectly delivered by a 
bacteriophage that encodes said protein. Beneficially, the proteins of the invention do not lyse the 
bacteria, and are of particular use where an infection is halted without cell lysis and the concomitant 
release of bacterial toxins. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[20] FIG. 1. Shutoff of Host RNA Synthesis. At various times after infection at 30°C, 
cultures were pulse-labeled with [5- 3 H] uridine. Single mutants 50- and 51- had no significant effect 
(data not shown), showing that both gene products, GP50 and GP51, contribute to the activity of 
GP50/51 ("GP" refers to "gene product" throughout). Each data point in such an experiment represents 
the sum of host and phage RNA synthesis. 

[21] FIG. 2. Shutoff of Host DNA Synthesis. At various times after infection at 30°C, 
cultures were pulse-labeled with [methyl- 3 H] thymidine. Only host DNA synthesis was assayed in such 
an experiment, since SPOl incorporates hmUra into its DNA in place of thymine. 
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[22] FIG. 3. Shutoff of Host RNA Synthesis. Host and phage RNA synthesis were 
monitored as described for 39-, 40-, and 39-40- SPOl. 

[23] FIG. 4. Shutoff of Host DNA Synthesis. Host DNA synthesis was monitored in 
39-, 40-, and 39-40- SPOl. 

[24] FIG. 5. Gene-Specific Expression Patterns. Cultures infected with wild-type 
SPOl, mutant 28-, or triple mutant 44-50-51- were harvested at 5, 15, or 25 minutes after infection at 
30°C. Triangles represent wild-type; squares represent the 28- mutant; and circles the triple mutant 44- 

50- 51-. 

[25] FIG. 6. Differential Effects of Mutations on Immediate-Early, Delayed-Early, 
and Middle Transcription. Cultures were infected with wild-type SPOl, mutant 44-, double mutant 50- 

51- , or triple mutant 44-50-51-. The numbers at the top represent the gene or genes present in that spot 
on the dot blot. The gene categories are indicated across the top. Gene 27 falls under both middle and 
delayed-early categories. 

[26] FIG. 7A. Distinguishable Effects of Genes 50 and 51 in the presence of the 44- 
mutation. Rates of RNA synthesis were measured by pulse-labeling with [5- J H]uridine. 

[27] FIG. 7B. Distinguishable Effects of Genes 50 and 51 in the absence of the 44- 
mutation. Rates of RNA synthesis were measured by pulse-labeling with [5- ? H]uridine. 

[28] FIG. 8. Expression of SPOl GP44 in E. coli. Cells of E. coli strain 
DH5(pPW27), which expresses SPOl gene 44 from an IPTG-inducible promoter, were transformed 
either with vector pACYC177 or with plasmid pPW40, in which the mutant lpoB gene had been cloned 
in pACYC177. Aliquots of the two cultures were induced by adding 0.2 mM IPTG at time 0, and were 
assayed for viable cells at various times thereafter, by plating on media lacking IPTG. Open symbols, 
no IPTG; Closed symbols, 0.2 mM IPTG added at time 0; Squares, mutant rpoB; and Circles, no mutant 
rpoB. Viable cell count is expressed in cells/ml X 10" 6 



[29] FIG. 9. Effect of Gene 40, Gene 44, Gene 50 and Gene 51 on Phage DNA 

Synthesis. 
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[30] FIG. 10. Effect of Gene 44, Gene 50 and Gene 51 on Single Step Growth Curves. 

[31] FIG. 1 1 . Effect of Gene 39 and Gene 40 on Single Step Growth Curves. 

[32] FIG. 12. Effect ofGP50 on Host DNA Synthesis. 

DETAILED DESCRIPTION OF THE INVENTION 

[33] The early gene region of the terminal redundancy of SPOl is found at GenBank 
Acc. No. AF031901. It is a linear 11500 bp dsDNA, where 5-hydroxymethyluracil (hmUra) is used 
instead of thymine as the base-pairing partner for adenine, and T's in the GenBank sequence represent 
hmUra. There are 24 small proteins encoded therein. The 24 genes are arranged in 12 operons, each of 
which includes between one and 4 genes. Genes 51 and 50 constitute one operon; gene 44 is the only 
gene in its operon; genes 37-40 are all in one operon. Genes 37-49 are transcribed rightward and 60-50 
are transcribed leftward, which is why the latter are expressed in reverse numerical order. The genes 
and their encoded proteins are listed in Table 1. 



Table 1 - Anti-bacterial proteins from SPOl 


Gene 
Name 


SEQ 

ID 

NO: 


Peptide Sequence 


37 


1 


MSDVI IPFLTSAVTAFIVAYLLDRWYIKRRR 


38 


2 


MSVQIKHGNKTFWDPSGDVKEGSYVLYLYEYRLGEVDVGRVSEVANDGRLYLDGPGVIVTLDQ 
PFILLKEWEEEEDEDDRIDAEFHNDPLLRKLENTTEKLTPEETQLAQWTTMTRVFSHDLKKGI 
PYAIKHKNSGNILYGLYSGLLNPVTALFRHLNEESKISIEQLKSGLIEIYEWEDEEESIWN 


39 


3 


MELNLD I YVD YKDKR YKAEG YYGPS VGDLVL I FMDMELEGATVQEVAR I EGS E IHLRTPNGNE P 
SYRYMGQYLILKPYGSSDPRGDILVHEDVQYVRVDAQAMPGDLIEALEPNKLPFSGKRFKYRPA 
VLEVEYVLTKDEQVLQLENGKSYSGAYRVLIPRMGVLPPKTHIYTTHKHVFMEDVFVLGNSYEL 
S S PNDVEMTP IHAVFTGF S KNRDEAI FVNP YYNDDGVTGTM I T VSDLLTGKWD I TPLVPKKGV 


40 


4 


MHIYTYWGLKYVPSNSTWAKEGDLILLGNEVHKWK^HRFRNITDLQITNWKGTETRYNLW 
TEYKVLVPYDTHKEENEAMSDSLITHNGKDYVLCKIPARVGDLIRTEDKRWEVLQKSKDGLVL 
YNEEKGEQRSAVYSEIGPYHVLVPRDTDTHTPTREELAAVIMNKAFTRTETQDSQEDTGTHKGL 
GLTGTDLYHSLRDLDAKVQSGYYTATENEEDVKSEIEATKKHMKAVKESGKTVNDYRKEENTKR 
CKLKALTNKFNRLFLKSVIDTDSLQVGKAYLIGGRDMK2^GLYTGTTFDQQHANFLIVETDRM 

HRTLTVS AEQLFAEERHI VD I EKRVEQTED 


41 


5 


MEKLPNTWKVHGEGMESKLFPRKLHKDTNSILREDLVSACQEHIEALV^GMIAHGDGRKVAEL 
DTSTQYYWHLKLVEYTPIPGRTQHYVDLVDGTNPDVCYFSLCDCSGDNITDRRWSNMVKRLQNP 

EEDIAKTLRCYFRQDAGMPSWIEYPQ 


42 


6 


MRKFVTTLTASPRNKKVGNHRLE I S PFVSLRRYYYFNTAI C I ENP VTRE FA I DDS YGSLSTNQN 
CAQYRQYFSLGGYKEVSLEEIHAV 


43 


7 


MIQLSERQQDLLQVAEKYEQCHIEFYTAQSRLFGTEIMGEWKTSLGTLKIAHPEEDLFEVALA 
YL AS KKD I LTAQERKD VLF Y I QNNLC 


44 


8 


MAKSNNVYVWGEEKVSTLAEVAKVLGVSRVSKKDVEEGKYDVV^ 
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Table 1 - Anti-bacterial proteins from SPOl 



Gene 
Name 


SEQ 
ID 

NO: 


Peptide sequence 






EEEDILEGVEWEDEEEEEAAEDVEEPTSEEDSEDEWEEGYPVATEVEEDEDEEIEYPEVGDFE 
DEKAIKKYIKGLTDEQLQAWCELEGAEWVENEHRNIN i J ^j_iR.iVi\f^>^rvi\rvoiv 
YAEYTTEELVEMAIDNNVEVRDDKGNERILRMYTIIALREAGLIS 


45 


9 


MMMDKQVEE VKKHYP I VEDWS VI VARKEDDCMTVTDA V PF I LAG Y KJU V b x i^miju x vvbLbfif J.u 
t Twpmn? FT .TTKTHFn^ VS FEE IGYEDKAMVYHVDLG 


46 


10 


MMTEDQKFKYLTKIEELEAGCFSDWTKEDITGDLKYLKKGIIEESIELIRAVNGLTYSEELHDF 
TQE 1 1 EELD I S PL 


47 


11 


MTM»7^™rrT?Mr ,r r^mTr\7KF T WMCJT ,FF AGRL YAVWLSDDHVYG I VDVNEEGL FCLGWVSD I S PES LQ 
NMLGGGAELFES YEDVLSEHGGS IAIRVEV 


48 


12 


MPYSKITVPVLVGEGLTEWDVIDVMRETHPPTVEDQYHYHTFDSMQNRTIFVLENPLYPDVDKI 
nr-v^jT r-T7\\mAT FnMT.DNVPVFDLPVTEEOGMVKRFTTKLASIVFDVFLI IPDFVSVTAKEE I 


49 


13 


M I KAAVTKES LYRMNTLMEAFQGFLGLDLGEFTFKVKPGVFLLTDVKS YLIGDKYDDAFNAL ID 
FVLRNDRDAVEGTETDVSIRLGLSPSDMWKRQDKTFTFTHGDLEFEVHWINL 


50 


14 


MDKLAAGGLYLLFLLLAGI I VTH 


51 


15 


MAKINKGYVANFIEENGFPEQGHFEEKKDLQAFYKHLSTEQLEEWVELEGLEVKDTDSDSIYRM 
i-.t ™*7v t t vt TvTT?T5x^v^7\r'T^T^v r z\QOYKMT^T.PFT,VOMATDNDIEVKHTDSDKILRMRTIiyL^ 

KT»G 


Ji. 




MTHF I S I ATYI YALVSAGF IGGWHDEESWI KDTEYEHGGYHMI IDTPAWNYSLE YGNYQWI FQ 
KYMKEGKVTVERF YRNS LD IPKEI LTDEALAF I KDWDENANE YELHAGEGVL YFKYEGE EKG YV 
I PMAYAGE IMFVPDEDAEKALE I INSQKKY 


53 


1 ""7 

17 


MRTYWNVSLDRSNGKRFERLVHYICVPIISIHHAEDTISMTRKEVGHLAETIANHIILDINGTY 
RTFSVNDIVHCSLEKVITLEGDVTNEFIDRLQILVNKEVQGSQSTQQSLSSVFESTLEKYNSPD 
DFADYLEETEEEVDYEDYSLDDTIDAISYALKTQEPVQAEWCLLMVDVYTGTLTEVTVETDKDK 

TLDSILGKYLENGFECVSKKRLGEVL 


54 


18 


MVI I KYTTKTQ PTPVKEMF I S PQH YAKWRSHMGS KLTS VKP I KGGR 


55 


19 


MFKLLTLFKRNKITSAEEYYTQAIHICEQFDRSTQKYTSM 


56 


20 


MFKYTDRSVRQYIERQQRSAMLEQEQAEKDKKERRKAGLLFFGTIVVLVAWAVYIVPQSLDAM 
WHENYEKPAQEAARN 


57 


21 


MTLFIAGVTLEEVREATVSALFVKLEQEKKALYLGAGSEDSLNLCKSTLDKVQEDYPLDDMEKD 
YLRDLLQFWLSRLFLGDGFEGEIPDSSEDLRRTATTAFTYTAAIRHYCM 


58 


22 


MTLAGYRVDSCNGCGKAYLVGESHDRKKCAECASK 


59 


23 


MKKRYKVTALFEDGTSQCLWGNFSSPTNAWCAAMRNLTPEGIARVQHYNVEEISK 


60 


24 


LNQVEVLREEYVEGYWQMWRRNPSNAPVIEVFTEDNLEEGIIPEYVTANDDTFDRIVDAVEFG 
YLEELELV 



[34] Single Step Growth Experiment: Cells of CB10 were infected at an MOI of 
about 0.1 at 30°C. The infected cultures were diluted through anti-SPOl antibody and plated on CB313 
lawns at 25 minute intervals. Plaque forming units (Pm) were counted and plotted against time. 



[35] SDS-PAGE: Cultures of CB10 were infected with appropriate strains of SPOl at 
an MOI of 5, and shaken at 30°C. At various times, aliquots were pulse labeled for 2 minutes with ( 35 S) 
Met-Cys. Extracted proteins were subjected to SDS-PAGE, and autoradiographs were prepared. 
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[36] RNA preparation: Cultures in VY medium were harvested at the indicated times 
after infection, as described previously (Wei and Stewart, 1993). The RNA was purified using the 
QlAGEN® RNeasy® Mini Kit, following the manufacturer's protocol. 

[37] Dot blots: PCR products carrying a particular gene or genes were denatured by 
incubating for 15 minutes at 37°C in 0.2M NaOH. 10 ng of DNA in 2 jxl were spotted onto positively 
charged nylon membranes, AMERSHAM® HYBOND™ N+, and bound by UV crosslinking. Each 1 1 
cm X 5 cm membrane held an array of about 23 spots. The PCR fragments included all, or nearly all, of 
the specified gene or genes, except for gene 31 which included only the first exon. 

[38] Hybridization: For each RNA preparation, a 1.0 |ig aliquot was labeled with 
alkaline phosphatase, using the AMERSHAM® AlkPhos™ Direct Labeling and Detection Kit. The 
entire labeled preparation was hybridized to one membrane carrying a dot blot array, using the 
hybridization buffer defined in the kit. Hybridization was at 60°C for two hours in a Rob BINS® Model 
400 hybridization incubator rotating at about 4 rpm. Wash buffers were as defined in the kit, with the 
primary wash at 65°C and the secondary wash at room temperature. Detection of hybridized RNA used 
the Amersham® CDP-Star™ Chemiluminescent Detection Reagent, following procedures from the 
kit. The alkaline phosphatase catalyses decomposition of dioxetane with emission of light, which is 
detected on film, using Hyperfilm ECU™ from Amersham®. All hybridizations were done under the 
same conditions, without attempting to identify optimal conditions for each gene, and we have not made 
independent measurements of the concentrations of the various transcripts or of their intrinsic 
hybridization efficiencies. Thus, gene-to-gene differences in the absolute amount of signal detected 
could be due to differences in intrinsic hybridization or labeling efficiency. Conclusions were drawn 
only from comparisons between signal intensities for the same gene under different conditions. 

[39] Densitometry: The intensities of the dot-blot signals were measured with a 
Molecular Dynamics™ computing densitometer. Positions at which no spot was visible on the film 
gave intensity values ranging from -20 to +20. Some spots that were faintly visible gave intensity 
readings less than 20, which are reported as measured, with the recognition that they represent only 
approximations of the actual value. Spots not visible on film are reported as 0. Intensity values as high 
as 1700 were within the linear dose-response range. Where undiluted samples gave values beyond the 
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linear range, we measured hybridization by diluted samples, and corrected for the dilution factor and the 
experiment-to-experiment difference in hybridization efficiency 

[40] Pulse Labeling: Rates of synthesis of host and phage RNA or protein were 
measured in C4 medium as described previously (Wei and Stewart, 1993), by pulse-labeling for 5 
minutes with [5- 3 H]uridine, or [4 ? 5-" > H]leucine, precipitating with TCA, and counting the precipitate. 
Since SPOl DNA contains hmUra in place of thymine, host DNA synthesis could be measured 
independently by pulsing with [methyl- 3 H]thymidine. Phage DNA synthesis was measured by pulsing 
with [2,8- 3 H]adenine, and measuring alkali -stable, TCA-precipitable counts. Pulsing with [2,8- 
3 H]adenine measures host as well as phage DNA synthesis, but most host DNA synthesis has been shut 
off by the time that phage replication begins, even in mutant infections, so incorporation of label after 
12 minutes is overwhelmingly into phage DNA. The time given for each pulse is the time at which the 
pulse began. 

[41] Mutagenesis: Cloned genes were mutated using the Stratagene© 
QuikChange® kit. In each case, a nonsense codon was substituted for the first or second lysine codon 
(or for the fi rst leucine codon in the case of gene 37) (codons 9, 10, 14, 10, 3, 1 1, 7, 3, 3, and 3 for genes 
37, 38, 39, 40, 44, 45, 46, 50, 51 and 56, respectively), and the mutants were propagated on a suppressor 
strain (CB313) which inserts lysine at nonsense codons. Each mutagenized plasmid was allowed to 
recombine with superinfecting SPOl. Recombinant progeny were identified by plaque-lift 
hybridization, using the AmershaiM ECL™ 3'-oligo labeling and detection kit, and allowed to segregate 
pure mutant strains. Multiple mutants were produced by successive recombination with the appropriate 
mutagenized plasmids and all genotypes were confirmed by DNA sequencing. The first 8 nucleotides of 
gene 50 overlap the last 8 nucleotides of gene 51, and the gene 50 mutation destroys the termination 
codon of gene 51, apparently permitting translation of gene 51 to run on for about 30 more amino acids, 
beyond its normal 151. Thus, definitive conclusions about the role of GP50 could not be made from 
single mutants in which only gene 50 was mutated. The gene 28 mutant was derived from mutant F21, 
initially described by Okubo et al (1972), by 5 successive back crosses to wild-type SPOl. We have 
previously called it sus28-l and our lab bookkeeping lists it as F21(5). 
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[42] Identification of Immediate-Early, Delayed-Early, and Middle Genes: 

Expression of specific genes was measured by dot blot hybridization and densitometry. FIG. 5 shows 
the expression pattern of the known early genes and several representative middle genes, in the wild- 
type or when altered by either the 28- mutation or the 44-50-51- triple mutant. The 28- mutation 
inactivates the middle gene-specific sigma, permitting identification of middle gene activity. Note that 
different genes are expressed on different scales, as the absolute level of expression varies widely from 
gene to gene. The densitometric values for each gene are plotted as a function of time after infection. In 
each graph, the number in the upper left corner indicates the gene whose activity is plotted. Where an 
operon of more than one gene was assayed, the graph is labeled with the number of the first gene in the 
operon. This applies to operons 37-39, 45-46, 48-49, 51-50, 55-53, 58-56, and 60-59. To a first 
approximation, the immediate-early genes have been arranged from top to bottom in order of increasing 
activity. Gene 27 could have been displayed with either the delayed-early or the middle genes. 

[43] Genes 31 and TF1 were previously known to be middle genes (Greene et al., 
1984; Scarlato and Gargano, 1992). The others are now identified as immediate-early genes (41-51 and 
53-60), delayed-early genes (27, 28, and 37-40), or middle gene (52), on the basis of expression time in 
wild-type infection and of the GP28-dependence of their expression. Gene 27 had characteristics of 
both delayed-early and middle genes. Genes 48-49 also showed substantial GP28-dependent expression 
at late times (data not shown). 

[44] For FIG 6, cells were harvested 15 minutes after infection at 30°C. RNA was 
prepared, labeled, hybridized to dot blots, and detected on film. The films were scanned using the 
UMAX® Vista Scan™ program with a UMAX® Astra 1220S™ scanner. A 25 X 25 pixel square 
surrounding each spot was copied and pasted into the grid displayed. 

EXAMPLE 1: ANTIMICROBIAL ACTIVITY 

[45] For analysis of antimicrobial activity we first cloned most of the 24 host-takeover 
genes (genes 37-60) singly or in multiple gene combinations into a B. subtilis/E. coli shuttle vector 
(pPW19) under control of an IPTG-inducible promoter (Wei and Stewart, 1993). Certain genes and 
gene combinations were not clonable using this vector system, suggesting that these genes are highly 
toxic, and that the small amount of leaky expression that occurs, even without induction, is sufficient to 
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kill the bacteria. These remaining genes are being cloned using expression vectors with more tightly 
regulated promoters, such as pJONEX4 (Sayers, 1996), pETlOl, or pX (Kim et a!., 1996). The 
transformed E. coli and B. subtilis were used to determine the antimicrobial activity, if any, of the 
cloned genes. The effect of induction on bacterial growth and viability was measured both in liquid 
medium and by colony formation. Definitive establishment of lethality was shown by loss of the ability 
to form colonies even after removal from the inducing agent. Indication of probable lethality was based 
on total inability to form colonies under inducing conditions. Data obtained to date are summarized in 
Table 2, and discussed in more detail below. 



Table 2. Summary of Lethality Data 


Gene 


E. coli 


B. subtilis 


Gene 


E. coli 


B. subtilis 


37 


Lethal * 


No data 


46 


Lethal 


Mildly Toxic 


38 


Highly toxic 


No effect ** 


45/46 


Highly Toxic 


Lethal 


39 


Lethal 


No data 


47 


No effect 


No effect 


38/39 


Lethal 


Lethal * 


48 


Strongly Toxic 


Strongly Toxic 


40 


Mildly Toxic 


Inconsistent 


49 


No data 


No data 


38/39/40 


Lethal * 


No data 


48/49 


Non-Toxic ** 


No data 


41 


Mildly Toxic 


Lethal * 


52 


Lethal 


Not tested 


42 


No effect 


No effect 


53/54/55 


Toxic 


Toxic 


43 


Indeterminate 


Indeterminate 


52/53/54/55 


Toxic 


Toxic 


44 


Lethal 


Lethal 


56 


Mildly toxic 


Lethal 


51 


Lethal 


Lethal 


57 


Mildly toxic 


No effect 


50/51 


Lethal 


Lethal 


58 


Mildly toxic 


No effect 


44/50/51 


Highly Lethal 


No data 


56/57/58 


Lethal 


Lethal 


45 


Toxic 


Mildly Toxic 


59/60 


Toxic ** 


Lethal ** 



* Lethality conclusion based on failure to clone, when appropriate controls cloned readily; **Data not conclusive. 



[46] Genes 37-40: Fragments carrying either of genes 37 or 39 could not be cloned in 
pPW19 in the expressed orientation, although each was cloned in the opposite orientation, suggesting 
that expression of each of these clones is lethal, even at the low level of expression without induction. 
Gene 39 was cloned in a more tightly controlled expression vector, and its expression was lethal. 

[47] Expression of gene 40 by itself was only mildly toxic to either E. coli or B. 
subtilis. However, a plasmid expressing the triplet combination 38/39/40 could not be cloned in E. coli, 
except in one case where spontaneous integration of insertion sequence IS-1 had inactivated gene 40. In 
this plasmid, expression of the remaining wild-type genes 38/39 was lethal to £. coli. The same plasmid 
could not be transformed into B. subtilis, suggesting that even the uninduced level of 38/39 expression is 
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lethal to B. subtilis. The fact that gene 40 had to be inactivated to permit cloning in E. coli suggests that 
GP40 reinforces the lethal activity of GP38 and GP39. 

[48] Mutational analysis in Example 2 (FIG. 4) shows that GP40 is essential for 
normal shutoff of host DNA synthesis, and that GP39 acts to restrain that activity of GP40. Mutational 
analysis (FIG. 3)also shows that both 39 and 40 are required for normal shutoff of host RNA and protein 
synthesis. The 38" mutation was also deficient in shutoff of host DNA, RNA, and protein synthesis. 
Mutants inactivating gene 37 have also been isolated, but results with them are thus far inconclusive. 

[49] Gene 41: Expression of gene 41 has only a mild effect on E. coli, but the plasmid 
carrying it could not be transformed into B. subtilis, suggesting that the low level of expression without 
induction is lethal to B. subtilis. The gene has apparently been cloned in B. subtilis, using a more tightly 
controlled vector, but experiments with that clone are thus far inconclusive. A mutation inactivating the 
gene is being isolated. 

[50] Genes 42 and 43: Gene 42 showed no toxic effect on either bacterium, and 
results on gene 43 were too inconsistent to permit any conclusion. 

[51 ] Gene 44: Expression of gene 44 was shown to be lethal to both B. subtilis and E. 
coli. FIG. 8 shows an example of killing of E. coli by expression of SPOl gene 44. In cells growing in 
liquid medium, expression of the cloned gene 44 was induced at time 0. At subsequent times, the 
number of cells remaining viable was measured by plating in the absence of inducer. More than 99% of 
the cells were killed within three hours. Similar experiments showed the lethal effect of gene 39, gene 
56 and two-gene operons 45/46 and 50/51. 

[52] Genes 44, 50, and 51: Expression of either gene 44 or 51 alone is lethal to either 
B. subtilis or E. coli, inhibiting DNA, RNA, and protein synthesis. Adding gene 50 to the 51 clone 
increases its lethal efficiency, as does combining all 3 genes in a single clone. An E. coli mutant 
resistant to GP44 has a mutant rpoB gene, suggesting that the cellular target for GP44 is the bacterial 
RNA polymerase. Mutants resistant to genes 51/50 have been isolated, but the mutant gene(s) have not 
yet been identified. GP51 and GP44 each include a segment similar to a known RNA polymerase- 
binding motif, suggesting that GP51 may also act on RNA polymerase, although the RNA polymerase 
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(RNAP) mutation that protects against GP44 does not protect against GPS 1/50. The effects on DNA 
and protein synthesis are probably indirect effects of the inhibition of host RNA synthesis. 

[53] We show that the products of SPOl genes 44, 50, and 51 are required for the 
normal transition from early to middle gene expression during infection of Bacillus subtilis by 
bacteriophage SPOl; that they are also required for control of the shutoff of host DNA, RNA, and 
protein synthesis; and that their effects on host-shutoff could be accounted for by their effects on the 
regulation of gene expression. These three gene products had four distinguishable effects in regulating 
SPOl gene expression: (1) GP44/50/51 acted to restrain expression of all SPOl genes tested; (2) GP44 
and/or GP50/51 caused additional specific repression of immediate-early genes; (3) GP44 and/or 
GP50/51 stimulated expression of middle genes; and (4) GP44 and/or GP50/51 stimulated expression of 
some delayed-early genes. Shutoff of immediate-early gene expression also required the activity of 
GP28, the middle gene-specific sigma factor. Shutoff of host RNA and protein synthesis was 
accelerated by either the 44- single mutant or the 50-51- double mutant, and more so by the 44-50-51- 
triple mutant. Shutoff of host DNA synthesis was accelerated by the mutants early in infection, but 
delayed by the 44-50-51 - triple mutant at later times. Although GP50 is a very small protein, consisting 
almost entirely of an apparent membrane-spanning domain, it contributed significantly to each activity 
tested. The combination clone of genes 44/50/51 is the most efficiently lethal construct tested thus far 
(0.06% survival after 5 minutes induction). 

[54] Genes 45 and 46: Expression of genes 45/46 is lethal to B. subtilis and highly 
toxic to E. coli. DNA, RNA, and protein synthesis are all affected about equally, showing a decrease of 
about 50% in the first hour, and no further decrease in the next 2 hours. No information is available yet 
on the cellular target, but resistant mutants should be readily obtainable. 

[55] Individually, gene 46 is lethal but with diminished efficiency, and gene 45 is toxic 
but not lethal. Early results suggest that, during infection, a double mutant knocking out both genes is 
deficient in shutoff of host RNA and protein synthesis, but has no effect on host DNA synthesis. 

[56] Gene 47: Gene 47 has little or no effect when expressed in either B. subtilis or E, 

coli. 
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[57] Genes 48 and 49: Gene 48 is strongly toxic but not lethal when expressed in 
either species. Although anomalies exist, the combination of genes 48 and 49 appears to be less toxic 
than 48 alone. 

[58] Genes 55-52: Expression of gene 52 is lethal to E. coli. Wild-type gene 52 has 
not yet been tested on B. subtilis. Expression of a clone with genes 55/54/53 is toxic, but not lethal. In a 
single experiment, the latter inhibited bacterial protein synthesis without affecting RNA synthesis. 
Expression of the full operon is toxic, but not consistently lethal. This is apparently accounted for by a 
transcription terminator located between genes 53 and 52. Mutants resistant to GP52 have been 
isolated. The mutant gene(s) have not yet been identified, but while looking for them, several wild-type 
DNA segments whose over-expression protects against GP52 have been isolated. One of them includes 
a cluster of genes whose products are involved with protein synthesis, adding another suggestion that the 
gene 55 to 52 operon may target host protein synthesis. Gene 52 and gene 53 mutants are presently 
being isolated. 

[59] Genes 58-56: Expression of the whole operon is lethal to both species. 
Expression of 56 alone is lethal to B. subtilis, specifically inhibiting cell division, without affecting 
DNA synthesis or chromosomal segregation. Infection by SPOl inhibits cell division, and a gene 56 
mutation prevents that inhibition. B. subtilis mutants resistant to GP56 have been isolated, but the 
mutant gene(s) has not yet been identified. 

[60] Genes 60-59: The effects of GP59 and GP60 are uncertain because of apparent 
instability of the clone and inconsistency of data, but it appears that their expression may be lethal to B. 
subtilis in liquid culture but not on plates, while expression in E. coli is toxic, but not lethal. 

[61] Since some of these genes can kill both gram-positive (B. subtilis) and gram- 
negative (E. coli) bacteria, they would probably be lethal to most bacterial pathogens. The first three of 
the lethal mechanisms studied targeted at least three different bacterial processes (RNA synthesis for 
gene 44, cell division for gene 56, and both DNA synthesis and RNA synthesis for genes 38, 39, and 
40). The lethal mechanisms studied target a panoply of bacterial processes and are thus ideally suited 
for the development of antimicrobial agents. 



HOUSTON 306348V1 61683-00004USPT 



15 



Docket No. : 61 683-00004USPT 

EXAMPLE 2: MUTATIONAL ANALYSIS OF PROTEIN ROLE 

[62] We have performed mutational analysis in order to determine what the roles of 
the 24 genes and their proteins are in the host takeover by SPOl . 

[63] Regulatory roles of genes 44, 50, and 51: By site-specific mutagenesis, we have 
introduced a nonsense codon near the beginning of each of these genes, and have constructed all 
possible combinations of mutant genes. The nonsense mutations accelerated the rate of host-shutoff 
FIG. 1 shows that shutoff of host RNA synthesis was accelerated by either the 44- single mutant or the 
50-51- double mutant, and more so by the 44-50-51- triple mutant. Each data point in such an 
experiment represents the sum of host and phage RNA synthesis, so the decrease in total RNA synthesis 
by 12 minutes after infection provides a minimum estimate of the extent to which host RNA synthesis 
had been shut off. For wild-type and 44-50-51- these values were about 50% and 95%, respectively. 
The subsequent increases are due to increased phage RNA synthesis. Single mutants 50- and 51 -had no 
significant effect (data not shown), showing that both GP50 and GP51 contribute to the activity of 
GP50/5L 

[64] Each of the three mutations contributed to this effect. 44- and 50-51- each shut off 
host RNA synthesis more completely than wild-type, but less completely than the triple mutant. FIG. 7 
shows that, while each of the 50- and 51- mutations had little effect by itself, each was necessary for the 
full effect of the triple mutant, 44-50-51-, and the double mutant, 50-51- (FIG. 7a and 7b) 

[65] Similar results were seen when shutoff of host protein synthesis was measured by 
pulse-labeling with [ 3 H]leucine, or by SDS-polyacrylamide gel electrophoresis of proteins pulse-labeled 
with [ 35 S]-labeled amino acids. Thus, GP44/50/51 restrained the normal shutoff of host gene 
expression, and each of genes 44, 50, and 5 1 contributed significantly to that activity. 

[66] FIG. 2 shows that shutoff of host DNA synthesis was accelerated by the mutants 
early in infection, but delayed by the 44-50-51- triple mutant at later times. Only host DNA synthesis is 
assayed in such an experiment, since SPOl incorporates hmUra into its DNA in place of thymine. 
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[67] Thus, GP44/50/51 delayed the shutoff of host RNA synthesis, and both GP44 and 
GP50/51 contributed significantly to that activity. GP44/50/51 delayed the early component of the 
shutoff of host DNA synthesis, while a later component required the activity of GP44 and/or GP50/51 . 

[68] Independent Role of GP50: GP50 includes only 23 amino acids, and consists 
almost entirely of an apparent membrane-spanning domain, with charged amino acids at each end. It is 
important to demonstrate that such an unusual protein actually plays a functional role. The first 8 
nucleotides of gene 50 overlap the last 8 nucleotides of gene 51, and the mutation that inactivates gene 
50 also destroys the termination codon of gene 51, permitting GP51 to run on for about 30 more amino 
acids, beyond its normal 131. Thus, demonstration of an independent role for GP50 depends on the 
effects of adding the gene 50 mutation to strains that already have gene 51 inactivated. This is shown in 
FIG. 7 and FIG. 12. 

[69] Regulation of SPOl gene expression. The negative effects of GP44/50/51 on 
shutoff of host RNA synthesis, and on the early phase of shutoff of host DNA synthesis, suggest that 
GP44/50/51 plays a regulatory role, rather than a directly causative role, in host-shutoff We believe 
that we have identified the nature of that regulatory role, in that GP44/50/51 regulate the expression of 
all of the SPOl genes known or expected to be involved in host-shutoff as shown in FIGs. 5 and 6. FIG. 
5 shows that the triple mutant decreased transcription of at least some middle genes, while dramatically 
increasing transcription of all immediate-early genes. Thus the most obvious effect of GP44/50/51 is to 
facilitate the transition from immediate-early to middle gene expression. 

[70] However, when the data from 44- and 50-51- are considered, it's clear that these 
three gene products cause at least 4 distinguishable effects, which may be seen in FIG. 6. The 
immediate-early genes shown are representative of all known immediate-early genes. All of the known 
delayed early genes are shown, and the middle genes shown were the only ones assayed. Gene 27 is 
expressed from both an early and a middle promoter. 

[71] The first effect is the repression of all early and middle genes, which requires the 
activity of both GP44 and GP50/51. In comparison with wild-type, either the 44- mutation or the 50-51- 
double mutant caused increased expression of all genes tested, including immediate-early, delayed- 
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early, and middle genes. Therefore, GP44/50/51 caused repression of all of the SPOl genes tested, and 
the full effect of this repression required the activity of both GP44 and GP50/51. 

[72] The second effect is residual repression of immediate-early genes by either GP44 
or GP50/5 1 , even in the absence of the other. In comparison with 44- or 50-5 1-, the triple mutant 44-50- 
51- caused a further increase in expression of immediate-early genes. Therefore, there must be residual 
repressive activity present in the 44- or 50-51- strains, which is no longer there in 44-50-51-, and which 
therefore must be caused by GP50/51 in the 44- strain and by GP44 in the 50-51- strain. 

[73] The third effect is stimulation of expression of middle genes by either GP44 or 
GP50/51. In comparison with 44- or 50-51-, the triple mutant caused a decrease in expression of middle 
genes. Therefore, there must be an activity present in the 44- or 50-51- strains, which stimulates middle 
gene expression. Since diat activity is no longer there in 44-50-5 1-, it must be caused by GP50/5 1 in the 
44- strain and by GP44 in the 50-5 1- strain. 

[74] The fourth effect is stimulation of expression of certain delayed-early genes by 
either GP44 or GP50/51. In comparison with 44- or 50-51-, the triple mutant caused a decrease in 
expression of delayed-early genes 27 and 28. Therefore, there must be an activity present in the 44- or 
50-51- strains, which stimulates delayed-early gene expression. Since that activity is no longer there in 
44-50-51-, it must be caused by GP50/51 in the 44- strain and by GP44 in the 50-51- strain. With 
respect to delayed-early genes 37-40, such an activity can be seen for GP44 in the 50-51- strain, but not 
conclusively for GP50/51 in the 44- strain. 

[75] These effects of GP44/50/51 on regulation of expression of early and middle 
genes provide a plausible explanation for the effects of GP44/50/51 on host shutoff. The effect on 
shutoff of host RNA synthesis can be explained if one or more of the immediate-early genes, regulated 
by GP44/50/51 , specifies a product that causes shutoff of host RNA synthesis. The over expression of 
such genes, caused by the triple mutant, could cause the accelerated shutoff of host RNA synthesis that 
was observed in triple mutant infection. The intermediate levels of over expression, caused by mutants 
44- and 50-51-, could account for their also causing intermediate levels of accelerated shutoff of host 
RNA synthesis. Similarly, the deficient shutoff of host DNA synthesis by the triple mutant at middle 
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times could be explained if that shutoff required one or more of the middle gene products whose 
expression is deficient in the triple mutant. 

[76] Alternative explanations, in which GP44/50/51 have activities, in addition to their 
regulation of SPOl gene expression, which directly affect host shutoff, remain possible, although 
intuitively less likely because of their additional complexity. A possible explanation of the second and 
third effects is that GP44 or GP50/51 facilitates the displacement of a A by GP28, but that is complicated 
by the fact that transcription of most host genes also requires a A . Under some conditions, rRNA 
synthesis is not shut off effectively by SPOl. Thus, a possible role of GP44/50/51 might be to prevent 
shutoff of rRNA synthesis, thereby accounting for some of the effect of 44-50-51- on RNA shutoff. All 
of these are potential subjects of future investigations. 

[77] The simplest hypothesis as to how GP44/50/51 regulate gene expression is that 
they interact with the host's RNAP to change its transcriptional specificity. Our earlier results had 
already suggested that RNAP was the cellular target for these proteins. When gene 44 was expressed in 
uninfected cells of either B. subtilis or E. coli, it caused the shutoff of bacterial RNA synthesis and cell 
death (Wei and Stewart, 1993, 1995). DNA and protein synthesis were also shut off, presumably as 
indirect results of the effect on RNA synthesis. An E. coli mutation substituting valine for glutamate at 
position 1272 of the beta subunit of the RNAP provided resistance to the lethal effects of gene 44 
(Sampath, unpublished results). When the gene 50/51 operon was expressed in uninfected cells, it also 
caused the shutoff of RNA, DNA, and protein synthesis and cell death (our unpublished results). The 
products of both genes 44 and 51 include acidic/hydrophobic domains similar to the domain in E. coli 
a 54 that is required for binding to RNAP, and one segment of GP51 shows substantial similarity to the 
product of SPOl gene 27, which is required for normal transcription of late genes (Greene et aL, 1982; 
Stewart, 1993). Because of the very small size of GP50 (23 amino acids), its presence in the same 
operon with, and overlapping gene 51, and the synergistic effects of mutations in genes 50 and 51 we 
hypothesize that GP50 participates in the same activities as GP51. We suppose that the observed effects 
of GP44 or GP50/51 in uninfected cells are incidental consequences of their binding to RNAP, which 
causes effects in the opposite direction when in the context of phage infection. 
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[78] Causative roles of genes 37, 38, 39, and 40 in host-shutoff. Nonsense 
mutations were introduced into eaeh of genes 37 to 40 as described above. FIG. 3 shows that both the 
40- and the 39- mutants are deficient in shutoff of host RNA synthesis, that 40- has the greater effect, 
and that the effects of the two mutations are not additive. These mutants had similar effects on shutoff 
of host protein synthesis (data not shown). However, as shown in FIG. 4, 40- inhibited but 39- 
accelerated the shutoff of host DNA synthesis, and 40- suppressed this effect of 39-. Apparently GP39 
restrains the effect of GP40 on shutoff of host DNA synthesis, while both work in the same direction on 
host RNA synthes 1S . Both shutoffs occur so rapidly that it's unlikely that either effect of the 40- 
mutation is an indirect effect of the other. Genes 37-40 comprise one operon, with 38 and 39 
overlapping slightly, suggesting the possibility of translation* coupling and joint activity (Stewart et al, 
1998). Preliminary results are consistent with GP37 and GP38 also participating in the same shutoffs. 
Thus, GP40 is required for normal shutoff of host DNA and RNA synthesis, and each of the other genes 
in this operon does or may play a significant role in each of those shutoffs. 

[79] Onset of Phage DNA Synthesis: FIG. 9 shows the effect of various of the mutant 
combinations on the onset of phage DNA synthesis. Each mutant combination that caused substantial 
delay in shutoff of host DNA synthesis also caused delay in phage DNA synthesis. This is probably 
because the continued use of cellular resources for host DNA synthesis meant that they were not fully 
available for phage DNA synthesis. Also contributing in most cases would be the diminished activity of 
phage genes required for phage DNA replication. 

[80] Single Step Growth Curves: FIG. 10 and 11 show the effects of the various 
mutations on single step growth curves. Despite the profound effect of several of the multiple mutants 
on phage gene expression and DNA replication, production of progeny phage was only moderately 
affected. Each mutant that caused significant delay in phage DNA synthesis also caused a significant 
extension of the latent period. Only those mutants with a deficiency in phage DNA synthesis caused 
decreases in burst size. These were less than proportional to the decreases in DNA synthesis, suggesting 
that phage DNA is produced in excess in wild-type infections under these conditions. 
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EXAMPLE 3: CONTINUED ANALYSIS 

[8 1 ] For those genes whose lethality is not yet certain, we will determine the effects of 
gene expression unequivocally by cloning in more tightly controlled expression vectors. For each of 
these lethal genes and gene combinations, we will attempt to determine the mechanism by which it 
exerts its lethal effect. 

[82] To determine the cellular targets of the lethal proteins, we will select bacterial 
mutants resistant to each of the lethal genes, by plating, under inducing conditions, the strain carrying 
the lethal gene, and identifying those colonies whose resistance is caused by a chromosomal mutation. 
Each mutant gene will be identified, by screening a library of the chromosomal DNA for capacity to 
protect against the lethal effect of the phage gene. The methodology used will be similar to that 
described in Wei and Stewart (1995). Mutants resistant to 4 of the lethal genes have already been 
isolated, and in this manner the gene 44- resistant mutation was determined to be in the rpoB gene, 
specifying the B subunit of the RNA polymerase. 

[83] In case the protective mutation cannot be located by such a library screen (for 
instance because the protective mutation proves to be recessive, or because the protective gene proves to 
be unclonable), we will locate the protective mutation by genetic mapping. For E. coli, we will 
determine the approximate location by co-transfer with the TnlO markers of the set of Hfr strains 
developed by Wanner (1986) and available from the £. coli Genetic Stock Center, and then determine 
the location more precisely by degree of linkage in PI transduction (procedures as in Miller, 1992). For 
B. subtilis, we will determine the approximate location by showing linkage in PBS1 transduction with 
markers represented in the Dedonder et al (1977) or Zahler (Vandeyar and Zahler, 1986) mapping kits, 
available from the Bacillus Genetic Stock Center. For finer structure mapping, we will look for the 
degree of co-transformation with markers within the region that showed high frequency co-transduction. 
The procedures are modifications of those described in Harwood and Cutting (1990). The availability of 
a complete set of mutants inactivating each gene in the 5. subtilis genome (Kobayashi et al, 2003) will 
permit precise location of each affected gene, which will then be confirmed by nucleotide sequencing. 

[84] Identification of protein iprotein interactions: To confirm the interaction of the 
lethal proteins with the targets thus tentatively identified, and to identify targets not found by such 
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mutational analysis, host proteins that interact with each lethal SPOl protein will be identified. We will 
make use of several genetic and physical techniques for identifying protein-protein interactions. Since 
such techniques can produce both false positives and false negatives (Legrain, 2002), we will use two or 
more of these techniques with each of the proteins. These techniques include: 

[85] Identification of cellular targets by genetic selection: Since these SPOl 
proteins interfere with essential host processes, many of them are lethal when expressed individually in 
uninfected bacteria. Thus, their cellular targets may be identified by selecting bacterial mutants resistant 
to their lethal effects. Most of the proteins have similar effects on E. coli and B. subtilis (which share 
substantial homology), permitting the use of the more versatile E. coli cloning systems for initial 
identification of certain cellular targets. Targets initially identified in E. coli will be confirmed by 
testing the effect of the B. subtilis homolog. 

[86] E. coli mutants resistant to the lethal effect of GP44 were selected by plating 
under inducing conditions. One mutated gene identified was the rpoB gene (which specifies the beta 
subunit of RNAP), by screening a genomic library for capacity to protect against the lethal effect (Wei 
and Stewart, 1995; Sampath, unpublished results). Although alternative explanations are possible, this 
strongly suggests that GP44 targets the host RNAP. Similar analyses are in progress with others of the 
SPOl lethal genes. 

[87] Pull-down assays. Proteins to be tested will either be expressed as glutathione-S- 
transferase (GST) fusions for attachment to glutathione sepharose beads (Pharmacia) or directly attached 
to N-hydroxysuccinimide (NHS)-activated sepharose resin (Pharmacia). The coupling of the NHS to 
primary amine is done under non-denaturing conditions at room temperature, and is quite efficient. 
Extracts of either infected or uninfected B, subtilis or E. coli cultures will be passed over the column, 
and the retained proteins will be eluted using increasing salt concentrations or urea denaturation. The 
eluant will then be subjected to SDS-PAGE, and the proteins electroeluted onto PVDF membranes for 
N- terminal sequencing and identification (Einarson and Orlinick, 2002). 

[88] Phage Display. Because only a few amino acids from one protein are needed to 
provide effective binding to another protein (Scott, 2001), it is not necessary to probe a library of whole 
proteins to identify binding partners. A phage display library (Willats, 2002; Barbas et aL, 2001) 
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displaying random sequence peptides 5 to 7 amino acids long will include most of the sequences 
necessary to confer specificity of binding to any protein. When a single protein is used to probe such a 
library, it can potentially identify the binding sequences of most or all proteins capable of binding to the 
probe protein. For instance, Malys et al (2002) probed a random sequence peptide library with the T4 
terminase protein, and identified all proteins previously known to bind to that protein, plus five other 
proteins whose binding was not previously known, but was subsequently confirmed in the one case 
conclusively tested. 

[89] Each of the proteins will be used to probe a random sequence peptide library, 
displayed on either phage T4 or phage Ml 3. Several Ml 3 libraries are available from New England 
Biolabs and have been used successfully in a long list of experiments (NEB, 2003). The peptide 
sequences identified as binding to each of the probe proteins will be used to probe the database of all B. 
subtilis and SPOl gene products, thus identifying those proteins having segments capable of binding to 
the probe protein. The B. subtilis and E. coli genomes are completely known, and the virtually 
complete SPOl sequence is also available (Hendrix et al, 2003). 

[90] Immunoprecipitation. Antibodies will be prepared against each of the proteins. 
Each antibody will be used to precipitate the targeted protein from extracts of infected cells, following 
procedures as described (Wei and Stewart, 1993; Wei and Stewart, 1995). The proteins in the 
precipitate will be identified using the same procedure as described. 

[91] Two-hybrid analysis. We anticipate that the above procedures will be sufficient 
to identify all or most of the proteins that interact with any of the lethal SPOl proteins. However, if 
further information is needed, we will make use of a two-hybrid system. Noirot-Gros et al. (2002) 
described the use of yeast two-hybrid screens to identify many interactions between B. subtilis proteins, 
and this procedure can readily be adapted to interactions between SPOl proteins and bacterial proteins. 

EXAMPLE 4: NONLYTIC PHAGE THERAPY 

[92] Certain pathogens (e.g. E. coli 0157:H7 or Neisseria meningitidis) cannot be 
treated with lytic phage or conventional antibiotics, because rapid destruction of the bacteria causes the 
precipitate release of lethal toxins that can kill the patient. The antimicrobial genes and peptides 
described herein solve this problem because each of them kills without lysing the bacterial cell. This has 
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been documented directly for several of the genes by showing either microscopically or 
turbidimetrically that the structure of the cells remains intact even after they have lost viability. 
Maintenance of structural integrity can also be inferred for almost all of them, since all but gene 52 are 
expressed early during infection and expression of all early genes is insufficient to cause lysis during 
infection. Thus, the proteins could be used to treat such infections directly, or a temperate 
bacteriophage could be used to target the genes to the bacteria. 

[93] A transducing bacteriophage, specific for the particular pathogen, would 
introduce the lethal SPOl gene(s), either on a plasmid that replicates in that bacterium or as part of a 
structure that would integrate into the bacterial genome. Exponential Biotherapies Inc. has successfully 
completed Phase 1 clinical trials of one of their first-generation phage therapy vehicles, a lytic phage 
that destroys pathogenic Enierococcus bacteria. They have patents on a method that selects for phages 
that survive in the circulation long enough to rescue animals from otherwise-lethal bacteremias 
(US2001 0026795 and US2001 0043924). The antimicrobial SPOl genes would be used in the second 
generation of phage therapy vehicles, in which individual lethal genes or gene combinations would be 
delivered to the infecting bacteria by a transducing phage. 

EXAMPLE 5: PEPTIDOMIMETICS 

[94] To be effective therapeutically, the antibiotic activities characteristic of these 
various lethal proteins must be delivered to the infecting bacterial pathogen. This could be 
accomplished in any of several ways, one of which is to design small molecule drugs that mimic the 
active sites of the proteins. To be effective, the antibiotic characteristic of various lethal proteins must 
be delivered to the infecting bacterial pathogen, in concentrations high enough to be therapeutic. By 
designing small molecule drugs that mimic the active sites of the lethal proteins, but that have favorable 
pharmacological characteristics with regard to delivery, uptake, human toxicity, and potency, a directed 
peptidomimetic molecule can be designed. Examples of successful use of such peptidomimetic small 
molecules include: Tian et al, 1998; Salvemini et al, 1999, 2002; Andrade-Gordon et al., , 1999; 
Wrighton and Gearing, 1999; Zhang, 2000; Ohkanda et al, 2001; Smith et al, 2001; Air et al, 2002; 
Gadek et al, 2002; Fotsch et al, 2003; and Martin et al, 2003. 
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[95] Structures of the lethal proteins will be determined. The lethal SPOl proteins are 
relatively small (ranging from 77 to 255 amino acids) so we can determine their structures by X-ray 
crystallography and/or NMR spectroscopy. Each functional site will be identified by 1) mutagenesis of 
the cloned gene, 2) selection for mutants that have lost their lethal activity, and 3) identification of the 
positions of the mutant amino acids in the 3-dimensional structures. Similarly, the structures of the 
target molecules will be determined, and the targeted surfaces identified by the positions of mutations 
causing resistance to the lethal proteins. Where possible, we will co-crystallize the lethal protein with 
the target molecule, to determine changes in structure caused by their interaction. 

[96] Small molecules can then be designed that attempt to mimic the structure of the 
active site. High throughput screening of many candidate molecules for binding to the target protein or 
inhibition of its function, depending on the specific target identified, will provide lead molecules for 
peptidomimetic chemicals. 

[97] Preparation for X-ray crystallography. The lethal SPOT proteins are relatively 
small (ranging from 77 to 255 amino acids) and their origins in a prokaryotic host make it likely that the 
proteins will be correctly folded on expression. Each of the purified proteins will be tested by light 
scattering and native gel electrophoresis to determine whether the samples are monodisperse. Most 
proteins will be tested for proper folding, by taking their Heteronuclear-Single Quantum Correlation 
(HSQC) spectra, using either the 500 or 600 MHz NMR at Rice University. To do this, the proteins will 
be ,5 N-labeled by growing cells in minimal media supplemented with ^N-ammonium chloride. Proteins 
that are partially unfolded can be rapidly identified by 15 N-HSQC NMR spectra, and the salt, pH and 
ligand conditions can be adjusted to provide a homogeneous, well-folded sample suitable for X-ray 
diffraction studies. Proteins that are well-folded and active will then be subjected to a sparse-matrix 
crystallization protocol to identify conditions appropriate for macromolecular crystal growth. The 
smaller proteins such as GP46 and GP56 would also be amenable to structure determination by NMR. 
They will be isotopically labeled with ,5 N or 15 N and 13 C and their structures determined by 
multidimensional heteronuclear NMR methods. In cases where a toxic protein is expressed in limited 
quantities, we will skip the NMR screening and proceed directly to crystallography. 
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[98] High-throughput crystallographic screening. The Rice University 
Crystallography Core Facility uses a Matrix Technologies® Hydra II-Plus One™ liquid-handling 
robot and automated imaging microscope, for high-throughput crystallographic screening (Diversified 
Scientific). Such facilities give us the ability to rapidly screen far more crystallographic conditions with 
far less material than traditional vapor diffusion methods. This is particularly useful for proteins whose 
yield has been limited due to their toxic nature. Using our high-throughput facilities, 1 mg of protein at 
10 mg/ml can be subjected to 500 trial conditions, as compared to 100 using manual methods. Analysis 
is facilitated by construction of relational databases" for correlation of high-throughput crystallization 
data with incomplete-factorial sparse matrix generation and development of new second generation 
matrices derived from the observed trials Recent studies from the NIH structural genomics initiative 
have suggested that in 90% of cases of a monodisperse and well-folded protein where a crystallization 
condition is found, the condition was found in only -200-500 trials. Failure to obtain crystals with good 
diffraction properties after 500 trials suggests that either addition of a ligand/cofactor or partial 
proteolysis studies may be appropriate for further trials. Once crystallization conditions have been 
identified, structures will be determined using either Multi-wavelength Anomalous Diffraction (MAD) 
(Hedrickson and Ogata, 1997) or Multiple lsomorphous Replacement (MIR) (McPherson, 1999). 
Model building and refinement (Shamoo et ai, 1995, Shamoo and Steitz, 1999) provides additional 
information from small sample sizes and allows generation of ideal peptidomimetic structures. 

[99] Mutagenesis. To map the functional site of each of the lethal proteins, genes 
encoding the lethal proteins will be subjected to error prone PCR in vitro (Spee et ai, 1993; Miyazaki et 
ai, 2000) and then reintroduced back into either E. coli or B. subtilis by transformation. Mutations that 
strongly reduce or abolish host lethality will be selected by survival of the host to form colonies. 
Sequencing of the mutants will permit mapping the changes onto the protein's primary structure, and 
thus onto the three-dimensional structure as determined above, hopefully providing a tentative 
identification of the active site. 

[ 1 00] Structure of target molecules and target sites. It should be possible to identify 
the target site within the target molecule, by identifying the amino acids whose mutation protects the 
target against the activity of the lethal protein, and finding which of those amino acids in the wild-type 
protein form a surface complementary to the binding surface of the lethal protein. Thus, when a target 
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has been identified, the wild-type gene specifying it will be cloned and subjected to mutagenesis by 
error prone PCR. The library consisting of that mutagenized population will be transformed back into 
the host cells, selecting for mutants resistant to the lethal protein. The position of the mutations will be 
determined by nucleotide sequencing, and those positions mapped onto the 3-dimensional structure. In 
cases where the target structure is not known, we will determine it whenever possible. Such analysis will 
allow us to determine a possible mechanism of action; for example resistant mutations may map to a 
known protein-protein contact surface or active site. We can then develop an accurate structure-function 
characterization of the host-S POl protein interaction and mechanism. 

[101] To minimize the likelihood of toxic side effects when the peptidomimetic 
molecules are introduced into clinical use, our highest priorities for the above analyses will be those of 
the lethal SPOl proteins that do not target a protein with an obvious human homolog. However, even 
those whose target does have a human homolog might be worth pursuing, since the target may function 
differently or be less accessible in human cells. 

EXAMPLE 6: ANTIMICROBIAL TREATMENTS 

[102] Information already in hand about some of these lethal proteins shows that they 
target a variety of different bacterial processes, so we expect to develop a panoply of different antibiotic 
mechanisms, each targeting a different bacterial process. Since none of the lethal gene products show 
substantial homology to any known protein, it is highly probable that each mechanism will be novel, 
meaning that bacteria resistant to currently used antibiotics should be susceptible to each of the 
antibiotics based on these lethal proteins. 

[103] The antibiotics to be developed would have the following advantages: (1) 
targeting of a wide spectrum of gram-positive and gram-negative bacterial pathogens; (2) effectiveness 
against anthrax, plague, and other anticipated agents of biological warfare; (3) effectiveness against 
bacteria that are resistant to presently known antibiotics; (4) inclusion of a diverse spectrum of antibiotic 
mechanisms; and (5) minimal release of lethal toxins during treatment. 

[104] Combining two or more lethal genes in the same clone can form more efficiently 
lethal constructs. For instance, the triplet combination of genes 44, 50, and 51 kills more than 20 times 
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as efficiently as either of its components cloned individually in the same vector (0.06% survival after 5 
minutes induction). 

[105] Since few, if any, existing antibiotics are based on a specific bacteriophage gene, 
and since few, if any, of the lethal SPOl gene products show substantial homology to any protein 
presently in the public database, it is likely that each lethal mechanism will be novel. This should help 
avoid the resistance mechanisms of the current generation of resistant bacteria. 
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